LASSAP, a LArge Scale Sequence compArison Package
نویسندگان
چکیده
MOTIVATION This paper presents LASSAP, a new software package for sequence comparison. LASSAP is a programmable, high-performance system designed to raise current limitations of sequence comparison programs in order to fit the needs of large-scale analysis. LASSAP provides an API (Application Programming Interface) allowing the integration of any generic pairwise-based algorithm. RESULTS Whatever pairwise algorithm is used in LASSAP, it shares with all other algorithms numerous enhancements such as: (i) intra- and inter-databank comparisons; (ii) computational requests (selections and computations are achieved on the fly); (iii) frame translations on queries and databanks; (iv) structured results allowing easy and powerful post-analysis; (v) performance improvements by parallelization and the driving of specialized hardware. LASSAP currently implements all major sequence comparison algorithms (Fasta, Blast, Smith/Waterman), and other string matching and pattern matching algorithms. LASSAP is both an integrated software for end-users and a framework allowing the integration and the combination of new algorithms. LASSAP is used in different projects such as the building of PRODOM, the exhaustive comparison of yeast sequences, and the subfragments matching problem of TREMBL.
منابع مشابه
LASSAP, a LArge Scale Sequence compAhson Package
Motivation: This paper presents LASSAP, a new software package for sequence comparison. LASSAP is a programmable, high-performance system designed to raise current limitations of sequence comparison programs in order to fit the needs of large-scale analysis. LASSAP provides an API (Application Programming Interface) allowing the integration of any generic pairwise-based algorithm. Results: What...
متن کاملUsing video-oriented instructions to speed up sequence comparison
MOTIVATION This document presents an implementation of the well-known Smith-Waterman algorithm for comparison of proteic and nucleic sequences, using specialized video instructions. These instructions, SIMD-like in their design, make possible parallelization of the algorithm at the instruction level. RESULTS Benchmarks on an ULTRA SPARC running at 167 MHz show a speed-up factor of two compare...
متن کاملNMscatt: a program for calculating inelastic scattering from large biomolecular systems using classical force-field simulations
Computational tools for normal mode analysis, which are widely used in physics and materials science problems, are designed here in a single package called NMscatt (Normal Modes & scattering) that allows arbitrarily large systems to be handled. The package allows inelastic neutron and X-ray scattering observables to be calculated, allowing comparison with experimental data produced at large sca...
متن کاملTasuke: a web-based visualization program for large-scale resequencing data
SUMMARY Because an enormous amount of sequence data is being collected, a method to effectively display sequence variation information is urgently needed. tasuke is a web application that visualizes large-scale resequencing data generated by next-generation sequencing technologies and is suitable for rapid data release to the public on the web. The variation and read depths of multiple genomes,...
متن کاملSecurity-Constrained Unit Commitment Considering Large-Scale Compressed Air Energy Storage (CAES) Integrated With Wind Power Generation
Environmental concerns and depletion of nonrenewable resources has made great interest towards renewable energy resources. Cleanness and high potential are factors that caused fast growth of wind energy. However, the stochastic nature of wind energy makes the presence of energy storage systems (ESS) in wind integrated power systems, inevitable. Due to capability of being used in large-scale sys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer applications in the biosciences : CABIOS
دوره 13 2 شماره
صفحات -
تاریخ انتشار 1997